Fast and Accurate Random Walk with Restart on Dynamic Graphs with Guarantees
نویسندگان
چکیده
ABSTRACT Given a time-evolving graph, how can we track similarity between nodes in a fast and accurate way, with theoretical guarantees on the convergence and the error? Random Walk with Restart (RWR) is a popular measure to estimate the similarity between nodes and has been exploited in numerous applications. Many real-world graphs are dynamic with frequent insertion/deletion of edges; thus, tracking RWR scores on dynamic graphs in an efficient way has aroused much interest among data mining researchers. Recently, dynamic RWR models based on the propagation of scores across a given graph have been proposed, and have succeeded in outperforming previous other approaches to compute RWR dynamically. However, those models fail to guarantee exactness and convergence time for updating RWR in a generalized form. In this paper, we propose OSP, a fast and accurate algorithm for computing dynamic RWR with insertion/deletion of nodes/edges in a directed/undirected graph. When the graph is updated, OSP first calculates offset scores around the modified edges, propagates the offset scores across the updated graph, and then merges them with the current RWR scores to get updated RWR scores. We prove the exactness of OSP and introduce OSP-T, a version of OSP which regulates a trade-off between accuracy and computation time by using error tolerance ε . Given restart probability c , OSP-T guarantees to return RWR scores with O(ε/c) error in O(log(1−c)( 2 )) iterations. Through extensive experiments, we show that OSP tracks RWR exactly up to 4605× faster than existing static RWR method on dynamic graphs, and OSP-T requires up to 15× less time with 730× lower L1 norm error and 3.3× lower rank error than other state-of-the-art dynamic RWR methods.
منابع مشابه
Fast and Exact Top-k Search for Random Walk with Restart
Graphs are fundamental data structures and have been em-ployed for centuries to model real-world systems and phe-nomena. Random walk with restart (RWR) provides a goodproximity score between two nodes in a graph, and it hasbeen successfully used in many applications such as auto-matic image captioning, recommender systems, and link pre-diction. The goal of this work is t...
متن کاملHow to Explore a Fast-Changing World (Cover Time of a Simple Random Walk on Evolving Graphs)
Motivated by real world networks and use of algorithms based on random walks on these networks we study the simple random walks on dynamic undirected graphs with fixed underlying vertex set, i.e., graphs which are modified by inserting or deleting edges at every step of the walk. We are interested in the expected time needed to visit all the vertices of such a dynamic graph, the cover time, und...
متن کاملTPA: Two Phase Approximation for Random Walk with Restart
Given a large graph, how can we determine similarity between nodes in a fast and accurate way? Random walk with restart (RWR) is a popular measure for this purpose and has been exploited in numerous data mining applications including ranking, anomaly detection, link prediction, and community detection. However, previous methods for computing exact RWR require prohibitive storage sizes and compu...
متن کاملREADS: A Random Walk Approach for Efficient and Accurate Dynamic SimRank
Similarity among entities in graphs plays a key role in data analysis and mining. SimRank is a widely used and popular measurement to evaluate the similarity among the vertices. In real-life applications, graphs do not only grow in size, requiring fast and precise SimRank computation for large graphs, but also change and evolve continuously over time, demanding an efficient maintenance process ...
متن کاملHow to Explore a Fast-Changing World
Abstract. Motivated by real world networks and use of algorithms based on random walks on these networks we study the simple random walks on dynamic undirected graphs, i.e., graphs which are modified by inserting or deleting edges at every step of the walk. We are interested in the expected time needed to visit all the vertices of such a dynamic graph, the cover time, under the assumption that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1712.00595 شماره
صفحات -
تاریخ انتشار 2017